Regression with n→1 by Expert Knowledge Elicitation

نویسندگان

  • Marta Soare
  • Muhammad Ammad-ud-din
  • Samuel Kaski
چکیده

We consider regression under the “extremely small n large p” condition. In particular, we focus on problems with so small sample sizes n compared to the dimensionality p, even n → 1, that predictors cannot be estimated without prior knowledge. Furthermore, we assume all prior knowledge that can be automatically extracted from databases has already been taken into account. This setup occurs in personalized medicine, for instance, when predicting treatment outcomes for an individual patient based on noisy high-dimensional genomics data. A remaining source of information is expert knowledge which has received relatively little attention in recent years. We formulate the inference problem of asking expert feedback on features on a budget, present experimental results for two setups: “small n” and “n=1 with similar data available”, and derive conditions under which the elicitation strategy is optimal. Experiments on simulated experts, both on simulated and genomics data, demonstrate that the proposed strategy can drastically improve prediction accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Expert Knowledge Elicitation of Feature Relevances in Sparse Linear Regression

In this extended abstract1, we consider the “small n, large p” prediction problem, where the number of available samples n is much smaller compared to the number of covariates p. This challenging setting is common for multiple applications, such as precision medicine, where obtaining additional samples can be extremely costly or even impossible. Extensive research effort has recently been dedic...

متن کامل

Elicitator: An expert elicitation tool for regression in ecology

Elicitator : an expert elicitation tool for regression in ecology. Abstract. Expert elicitation is the process of retrieving and quantifying expert knowledge in a particular domain. Such information is of particular value when the empirical data is expensive, limited or unreliable. This paper describes a new software tool, called Elicitator, which assists in quantifying expert knowledge in a fo...

متن کامل

Designing Elicitor: Software to Graphically Elicit Expert Priors for Logistic Regression Models in Ecology. 2006

ELICITOR is graphical elicitation software created to elicit normal prior distributions for a Bayesian logistic regression model. Motivated by a real need to include expert knowledge in presence–absence models in ecology, this research describes a synthesis of theory from statistics, psychology and ecology. The aim was to build elicitation software that would be user friendly to environmental s...

متن کامل

Eliciting expert knowledge in conservation science.

Expert knowledge is used widely in the science and practice of conservation because of the complexity of problems, relative lack of data, and the imminent nature of many conservation decisions. Expert knowledge is substantive information on a particular topic that is not widely known by others. An expert is someone who holds this knowledge and who is often deferred to in its interpretation. We ...

متن کامل

Knowledge Elicitation for Design Task Sequencing Knowledge

There are many types of knowledge involved in producing a design (the process of specifying a description of an artifact that satisfies a collection of constraints [Brown, 1992]). Of these, one of the most crucial is the design plan: the sequence of steps taken to create the design (or a portion of the design). A number of knowledge elicitation methods can be used to obtain this knowledge from ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016